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Abstract. The Boltzmann-Gibbs probability distribution, seen as a statistical model, belongs 
to the exponential family. Recently, the latter concept has been generalized. The ^-exponential 
family has been shown to be relevant for the statistical description of small isolated systems. 
Two main applications are reviewed: 1. The distribution of the momentum of a single particle 
is a q-Gaussian, the distribution of its velocity is a deformed Maxwellian; 2. The configurational 
density distribution belongs to the q-exponential family. 

The definition of the temperature of small isolated systems is discussed. It depends on 
defining the thermodynamic entropy of a microcanonical ensemble in a suitable manner. The 
simple example of non-interacting harmonic oscillators shows that Renyi's entropy functional 
leads to acceptable results. 



1. Introduction 

In statistics, a model is a probability distribution which depends on a number of parameters. The 
probability distributions of statistical physics usually depend on parameters and can therefore 
be seen as statistical models. Typical parameters are the total energy U in the microcanonical 
ensemble, or the inverse temperature f3 in the canonical ensemble. This parameter dependence 
is important in order to understand why certain models belong to the exponential family and 
others do not. In particular, all models described by a Boltzmann-Gibbs distribution belong to 
the exponential family because they have the right dependence on the inverse temperature (5. 

Recently, the notion of the exponential family has been generalized by the present author in a 
series of papers [U El [3l 01 El [6] . The same definition of generalized exponential family has been 
introduced in the mathematics literature [U B [H El E] • This class of models was also derived 
using the maximum entropy principle in |10l [TT] and in the context of game theory [X2^ [13] . Here 
we concentrate on the more specific notion of the ^-exponential family and review the evidence 
that this family appears in a natural manner in the context of the microcanonical description 
of a classical gas. 

The notion of ^-exponential family is connected with Amari's a-family [J3], studied in the 
context of information geometry. The geometric approach is very appealing also in the context 
of statistical physics. See for instance \lb\ I16j. However, this topic will not be discussed in the 
present paper. 

Note that in many of the formulas given below Boltzmann's constant k-Q is set equal to 1, 
which means that temperatures are measured in units of energy. 



2. The Boltzmann-Gibbs distribution 

The Boltzmann-Gibbs (BG) distribution is given by 



fp(x) = -^jj\ e PH{x) i x in phase space (1) 

The Hamiltonian H(x) is a given function. It determines the physical model. (3 is a parameter, 
corresponding with the inverse temperature of thermodynamics. The function c(x) quite often 
is just a constant. Z(f3) is called the partition sum. It normalizes the probability distribution: 

Z{p) = [ dxc(x)e^ H ^. (2) 



= InZ(P) corresponds in thermodynamics with Massieu's function. It satisfies the identity 

d$ 

= -o < 3 > 

with U the energy, this is, the expected value of the Hamiltonian H(x) 

U=EpH= [ dxfp(x)H(x). (4) 



The second derivative of Massieu's function satisfies 

d 2 $ 



fH2 - E P (H-Uy>0. (5) 

Therefore, $(J3) is a convex function, as expected from thermodynamics. Indeed, in thermo- 
dynamics Massieu's function is by definition [T7] the Legendre transform of the thermodynamic 
entropy S(U), which is a function of the energy U. Within statistical physics the thermodynamic 
entropy can be defined by means of an entropy functional, 1(f), which is a function over the space 
of all probability distributions /. The most common entropy functional is that of Boltzmann- 
Gibbs-Shannon (BGS) 

m =- J dx f& in ^M- ( 6 ) 

It is well-known that /(/) takes its maximal value at / = fp within the set of all / satisfying 
¥,fH = U. This maximal value of /(/) is then identified with the thermodynamic entropy S(U). 
It satisfies 

S(U) = max{/(/) : E f H = U} = I(fp) 



Comparison with 



dxfp(x) (mZ((3)+(3H(x)) 
\nZ(P)+pU. (7) 



<S>(P) = max{S(U)-f3U} (8) 

then yields <3?(/3) = InZ(P). From ([71 [8]) follows the inverse transformation 

S(U) = inf{$(/?)+/3£/} (9) 

and an identity, which is the dual expression of ©, 

The latter relation is often used as the definition of the inverse temperature. Another 
consequence of © is that S(U) is a concave function. 



3. The exponential family 

A model of classical (statistical) mechanics is determined by a Hamiltonian H(x). On the 
other hand, a statistical model is defined as a parameterized probability distribution function 
(pdf). It is now immediately clear that the Boltzmann-Gibbs distribution ([1]) defines a statistical 
model with one parameter (3. Moreover, it belongs to a very special class of models called 
the exponential family. In fact, a model belongs to the exponential family with parameters 
9 = 02, • ■ • , Q n ) when its pdf f${x) can be written into the canonical form 

fe(x) = c(x)exp f^i^x) - a(0)j , (11) 

where c(x), Kj(9), and oc(0) are known functions. 

Models belonging to the exponential family share a number of nice properties. For instance, 
they satisfy the identities, which generalize ([3]), 

da I 

— = EgKj = J dxf (x)Kj(x). (12) 

Note that the functions Kj(x) may not depend on the parameters 0. In statistical physics one 
uses sometimes effective Hamiltonians which depend on the temperature /3 . The corresponding 
statistical models may exhibit thermodynamic instabilities which cannot occur in models 
belonging to the exponential family. 



4. Deformed logarithmic and exponential functions 

The Boltzmann-Gibbs distribution is so dominant in statistical physics that one can get the 
impression that this is the only relevant statistical model. However there exists a much larger 
class of models, sharing most of the nice properties of the exponential family. Some of these 
more general models appear in a natural manner in statistical physics, as will be shown further 
on. One way to get access to this larger class is by replacing the exponential function in the 
definition (|lip by some other function. This can be done in a fairly general way [HE]. Here, 
a one-parameter deformation is considered, which is the basis for the generalization known as 
non-extensive statistical mechanics |18j . 

Fix a real number q ^ 1. The (/-deformed logarithm is defined by [W\ !2Uj 

ln q (u) = {u 1 ^ ~ 1) , « > 0. (13) 

The basic properties of the (/-deformed logarithmic function are that ln 9 (l) = and that it is a 
strictly increasing function. Indeed, its first derivative is always positive 

^-mJu) = — >0 (14) 

For q > the (/-logarithm is a concave function. In the limit q = 1 it reduces to the natural 
logarithm In it. 

The inverse function is the deformed exponential function 

exp q (u) = [l + (l-q)u] l / {1 ~ q) . (15) 

The notation [u] + = max{0, u} is used. One has < exp (u) < +oo for all u. For g / 1 the 
range of m q (u) is not the full line. By putting exp. (it) = when u is below the range of m q (u), 



Figure 1. The g-deformed loga- Figure 2. The g-deformed ex- 
rithm ln q (u) for (/-values 0.5 (red), ponential exp q (u) for g-values 0.5 
1 (black), and 2 (blue). (red),l (black), and 2 (blue). 



and equal to +oo when it is above, exp g (it) is an increasing function of u, defined for all values 
of u. With this extended definition one has 

exp 9 (ln g (u)) = u for all u > 0. (16) 

However, the relation ln g (exp 9 (n)) = u is only defined when expg(ii) is neither or +oo. 

The basic properties of the g-deformed exponential are that exp (? (0) = 1 and that it is an 
increasing function. Indeed, one has 

^exp,(n) = [exp q (u)] q > 0. (17) 
For q > it is a convex function. 
5. The q- exponential family 

The deformed logarithmic and exponential functions can be used in more than one way to develop 
a generalized formalism of statistical physics. The original approach of non-extensive statistical 
physics [2T] was to first generalize the Boltzmann-Gibbs-Shannon entropy functional (0) by 
deforming the logarithm contained in it. Next one studies pdfs which maximize this entropy 
functional under given constraints. However, it turns out to be advantageous to generalize 
immediately the notion of exponential family by deforming the exponential function appearing 
in (jlip . This leads to the definition that the one-parameter model fe(x) belongs to the q- 
exponential family if there exist functions c(x), a(0), H(x) such that one can write 

f e (x) = c(x) ex Pq (-a(9) - 9H(x)) . (18) 

In the limit q = 1 this reduces to the standard definition of the exponential family with = (5. 
Note that we take into account the possibility that the parameter 6 does not necessarily coincide 
with the inverse temperature (3 as it is defined by the thermodynamic relation (fTU|) . 
Introduce the g-deformed entropy functional 

W) = - j dxf(x)ln q ^ 



dxc{x) ( V I • ( 19 ) 



1 — q \ J \ c[x 

In the limit q = 1 it reduces to the BGS-entropy functional ([6]). Note that this definition is 
almost the entropy functional originally introduced by Tsallis [21] , but differs from it by the fact 
that the parameter q has been substituted by 2 — q. 

Assume now < q < 2. Then one can show that the pdf / = f${x) maximizes the quantity 

J—I q (f)-9E f H. (20) 
z q 

In the q = 1-case this result is known as the variational principle. A proof of this statement 
involves the introduction of an appropriate Bregman divergence — see [TJ Note that the 
variational principle implies the maximum entropy principle: I q (fg) > Iq(f) for all / which 
satisfy ~KfH = EgH (using the notation E# = IE/ e ). A well-known problem is now that fg also 
maximizes £(I q (f)), where £(k) is an arbitrary monotonically increasing function. Therefore a 
meaningful ansatz is to assume that the thermodynamic entropy S(U) is given by 

S(U) = £ (/,(/„)) = £ (a(0) + 9~E e H) . (21) 



The thermodynamic expression for the inverse temperature (5 can then be calculated using (|10p . 
It is clear that the resulting relation between energy U and temperature will depend on the 
choice of the monotonic function £(ii). Physical arguments are needed to decide which choice is 
the right one. 

6. g-Gaussians 

The g-Gaussian distribution in one variable is given by (see for instance [22 ^ 123^ 121]) 



1 



f{x) = _ exP(? (-x7^), (22) 

CqCT 



with 



r + 



1 I 1 



dx exp q (-x A ) = \l—^—r 7-4-^ if 1 < q < 3, 



— r 1 + A 



if g < 1. (23) 



It can be brought into the form (|18|) with c(x) = l/c 9 , -ff(ic) = x 2 , 9 = <t 3-9 , and 



a(9) = = ln 2 _ 3 (<7). (24) 



The g = 1-case reproduces the conventional Gauss distribution. For q < 1 the distribution 
vanishes outside an interval. Take for instance q = 1/2. Then (|22p becomes 



/(*) 



15y/2 I" 
32cr ~ ^2 



(25) 



This distribution vanishes outside the interval [— a, a]. In the range 1 < q < 3 the ^-Gaussian is 
strictly positive on the whole line and decays with a power law in \x\ instead of exponentially. 
For q = 2 one obtains 

f(x) = ~^— 2 . (26) 

This is the Cauchy distribution. The function (|26[) is also called a Lorentzian and is often used 
in physics to fit the shape of spectral lines. For q > 3 the distribution cannot be normalized 
because 

f& ~ lap/fr-i) as M 00 ( 27 ) 

is not integrable anymore. 




7. A ^-deformed Maxwell distribution 

The most straightforward way to produce an important example of g-Gaussians in statistical 
physics is by considering marginals of the uniform distribution on a hypersphere - see [25j. The 
Hamiltonian of an iV-particle classical ideal gas is given by 

1 N 

i=i 

where m is the mass of the particles and pj is the momentum of the j-th particle. Given the 
value E for the total energy, the phase space consists of all points of the 3iV-dimensional sphere 
with radius \/2mE. Assume equal probability of all points of phase space. Let B n (r) denote the 
volume of a sphere with radius r in dimension n. The probability distribution for the momentum 
of a single particle becomes 

/Cpx) = w ^ w / d, P'"-^^- g M) 
i ^^(ff-ipilVM 

(2m)3/2 

= ^exp g (-| Pl | 2 /2cT 2 ) (29) 



with 



3iV- 6 2 2mE 

Q = — 77 7 and a = —— -. (30) 

H 3N-A 3N-A v ' 

This is a g-Gaussian with q < 1. The appearance of a cutoff can be easily understood. 
Arbitrary large momenta are not possible because of the obvious upperbound |pi| 2 < 2mE. 
The probability distribution of the scalar velocity v = |pi|/m becomes 

fq(v) = -^-i> 2 exp 9 (^-^2 mv2 ^j ■ ( 31 ) 

Only in the limit of large systems it converges to the Maxwell distribution. One concludes that 
the Maxwell distribution is an approximation and is only valid in the limit N — > do. 

8. Configurational density of classical gases 

The configurational density of a gas of classical particles always belongs to the g-exponential 
family [26]. The idea behind the calculation is the same as that of the previous section. This 
kind of non-extensivity of a classical gas was observed already quite some time ago [27\ |2"B] . It 
becomes a strong statement in the language of ^-exponential families. 

A classical gas of N interacting particles is described by the Hamiltonian 



1 N 

5=1 

where V(q) is the potential energy due to interaction among the particles and between the 
particles and the walls of the system. The microcanonical ensemble is then by definition the 
following singular probability distribution 

/t/(q,p) = -^6(U - H(q,p)), (33) 

where <?(•) is Dirac's delta function and uj(U) is the normalization 

= mt I dpi • • • d PA r / dqi • • • dq,v S(U - H (q, p)). (34) 

« d Jr^N J R 3N 

The constant h is introduced for dimensional reasons. Because of the quadratic nature of the 
kinetic energy term it is possible to integrate out the momenta pj. The resulting function is 
called the configurational density function and is evaluated as follows 



/&° nf (q) = TW / dpi---d PJ v/ t/ (q,p) 

1 1 f 

= T3N~rm / dpi • • ■ dpAr £([/ - ff(q, p)) 

k iN LO(U) J R 3N 

= wzm^u L dpi • • • dp " 9 ( u ~ v(q) - i t w 2 ) 



2tf N lo{U) 



(2m) 3N/2 B(3N) [U - V(q)] 



= c N exp g (-a(6) - 0V(q)) , 



(35) 



CN 



with 

'2m\ 3N / 2 1 1 

/i 2 y ' l-g[r(3iV/2V(C/)] 1 -?' 

a(0) = -jV-l-6»C/, g = l--— . (36) 

v > 2 * 3iV-2 v 7 

Assume now that cj(?7) is a strictly increasing function. Then it can be inverted to obtain 
U as a function of 6. Under this assumption the configurational density function belongs to 
the (/-exponential family. As a consequence, one immediately knows that (q) maximizes 
the entropy functional I q (f)- It is then tempting to identify / g (/™ nf ) with the configurational 
entropy. However, as noted before, £(-fq(/j/ onf )) could serve as well, where is an arbitrary 
monotonic function. 



9. On the definition of temperature 

There is no consensus in the literature what is the correct definition of the thermodynamic 
entropy S(U) for isolated systems. The question is of importance because it directly determines 
the definition of the thermodynamic temperature via the relation ()10p . Most often one quotes 
the definition known as Boltzmann's entropy 

S(U) = k B lnuj(U), (37) 

where oj(U) is the density of states (|34p . However, this choice of definition of entropy has some 
drawbacks. For instance, for the pendulum the entropy S(U) as a function of internal energy 
U is a piecewise convex function instead of a concave function [2j. The lack of concavity can 
be interpreted as a microcanonical instability [29| [30]. But there is no physical reason why the 
pendulum should be classified as being instable at all energies. 

The shortcomings of Boltzmann's entropy have been noticed long ago. A slightly different 
definition of entropy is [281 ED GE1 [33] (see also in [31] the reference to the work of A. Schliiter ) 

S(U) = k B hLQ(JU), (38) 

where £l(U) is the integral of ui{U) and is given by 

Q ( U ) = im I dpi-.-dpjv/ dq 1 ---dq jV G(i7- J ff(q,p)). (39) 

« J R 3N 

An immediate advantage of (|38|) is that the resulting expression for the temperature T, defined 
by the thermodynamical formula 

1 dS 

T = W (40) 
coincides with the notion of temperature as used by experimentalists. Indeed, one finds 

For a harmonic oscillator the density of states uj(U) is a constant. Hence, (|4ip implies k B T = U, 
as wanted. It is well-known that for classical monoatomic gases the r.h.s. of (|4ip coincides with 
twice the average kinetic energy per degree of freedom. Its significance is that the equipartition 



theorem, assigning {l/2)ksT to each degree of freedom, does hold for the kinetic energy also in 
the microcanonical ensemble. Quite often the average kinetic energy per degree of freedom is 
experimentally accessible and provides a unique way to measure accurately the temperature of 
the system. 

But also (|38p and (|4ip are subject to criticism. In small systems finite size corrections appear 
|34} [35] for a number of reasons. As argued in [35], the problem is not the equipartition of the 
kinetic energy over the various degrees of freedom, but the relation between temperature and 
kinetic energy. 



10. Example 

Let us follow [25] and consider the example of 3N non-interacting harmonic oscillators. The 
potential energy equals 



j 3N 
V(q) = -mw 2 ^q2. 



(42) 



One calculates 



Q(U) 



h 3N 



1 



dpi • • • dp 3 7v 

3N 



dqi 



• dq 3N G ^U-J-Y^p*- ]T q| j 



{3N)\lo 3N \ h J ' 
Prom ()36p and (|43p now follows 

i _ r r(3iv/2) 1 

6» { q) T(3N) uj 3N 
Using Stirling's approximation one obtains 



(43) 



37V 



r(3iV/2) 

r(3JV) 



V2 



e \ZN/2 
6iV/ 



i-<? 



so that 



1 /4tt ?7 y 



(44) 



(45) 



(46) 



which shows that 1/9 is roughly proportional to U 2 . This is not what one expects. In the 
canonical ensemble is the temperature 1 / (3 proportional to U 



1 

P 



U 
3N 



(47) 



One therefore concludes that the parameter 9 cannot be the inverse temperature (3. Note that 
= d/5* conf /dU coni holds with 5 conf = I q (fu° ni ) and U coni = E e V = \U . Therefore one concludes 
that S c cannot be the thermodynamic configurational entropy S coni , but the two entropies 
must be related by a non-trivial function £(u). From (|47p follows 



^conf = ^ ln?7 conf 



A, 



(48) 



for some constant A. But (|46p implies that 



ST 



conf 1 huj\ 2 3iV 2 

cont — \ T \ + constant. 

e V 4tt / U con{ 



(49) 



Therefore the relation between S coni and g conf is logarithmic. More precisely, 5 conf = £(5 conf ), 
with £(x) of the form = —3Nln(B — x) + C, with constants B and C. This suggests that 
Renyi's entropy functional is the right one to start with. Indeed, let a = 2 — q. The relation 
between Renyi's I a (f) and I q {f), as given by §T§§, is I a (f) = with 



1 



1 



ln(l - (l-g)u) . 



Take the constant in (|49[) equal to 1/(1 — q). Then one obtains 

C^iconf _ ^ ^ ^iconf 



3iV 
"IT 



1 



jyconf 

In — h constant 



6iV 



This yields 



d5 conf , 3N 



- 



dU coni V 2 



1 



3iV 



jyconf 



17 



(50) 



(51) 



(52) 



which is an acceptable expression for the inverse temperature (3. 



11. Concluding remarks 

The notion of a ^-exponential family of statistical models has been reviewed. The g-Gaussian 
distribution belongs to this family. The probability distribution of the momentum of a single 
particle in a classical gas is such a g-Gaussian. An immediate consequence is that the velocity 
distribution of a classical particle is a (/-deformed Maxwell distribution. In particular, the 
standard Maxwell distribution is an approximation which is only valid in the limit of large 
systems. 

The configurational probability distribution of a classical gas always belongs to the q- 
exponential family. The non-extensivity parameter q is given by 

' 3 iV - 1, (53) 



l-q 2 

where iV is the number of particles. The latter expression has appeared quite often in the 
literature, see for instance [2? 1 [36 ], [37]. 

Finally, a system of 3iV independent harmonic oscillators is considered. It is known that 
Tsallis' entropy and Renyi's entropy are equivalent in the context of the maximal entropy 
principle. But they make a difference from a thermodynamical point of view. The example 
suggests that the correct thermodynamical definition of temperature requires the use of Renyi's 
entropy. This observation has been made before in a different setting |28(, I33j. 
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